23. References: Deep Neural Network ASR
References: Deep Neural Network ASR
Deep Speech 2
The following presentation, slides, and paper from Baidu on DeepSpeech 2 were important resources for the development of this course and its capstone project:
- Amodei, Dario, et al. "Deep speech 2: End-to-end speech recognition in english and mandarin." International Conference on Machine Learning. 2016.
- Presentation
- Slides
Language modeling with CTC
Gram-CTC from Baidu on integrating a language model into CTC for better performance:
Language modeling with CTC based on weighted finite-state transducers (WFSTs):